Similarity Search for Multi-dimensional NMR-Spectra of Natural Products
نویسندگان
چکیده
Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring products is an important task to investigate new potentially useful chemical compounds. We develop a set-based similarity function, which, however, does not sufficiently capture more abstract aspects of similarity. NMR-spectra are like documents, but consists of continuous multi-dimensional points instead of words. Probabilistic semantic indexing (PLSI) is an retrieval method, which learns hidden topics. We develop several mappings from continuous NMR-spectra to discrete text-like data. The new mappings include redundancies into the discrete data, which proofs helpful for the PLSI-model used afterwards. Our experiments show that PLSI, which is designed for text data created by humans, can effectively handle the mapped NMR-data originating from natural products. Additionally, PLSI combined with the new mappings is able to find meaningful ”topics” in the NMR-data.
منابع مشابه
An Evaluation of Text Retrieval Methods for Similarity Search of multi-dimensional NMR-Spectra
Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring substances is an important task to investigate new potentially useful chemical compounds. Multi-dimensional NMR-spectra are relational objects like documents, but consists of continuous multi-dimensional points called peaks instead of words. We develop several mappings from continuous NMR-spectra to discrete tex...
متن کاملMedian Modified Wiener Filter for nonlinear adaptive spatial denoising of protein NMR multidimensional spectra
Denoising multidimensional NMR-spectra is a fundamental step in NMR protein structure determination. The state-of-the-art method uses wavelet-denoising, which may suffer when applied to non-stationary signals affected by Gaussian-white-noise mixed with strong impulsive artifacts, like those in multi-dimensional NMR-spectra. Regrettably, Wavelet's performance depends on a combinatorial search of...
متن کاملEffects of Clinacanthus nutans leaf extract on lipopolysaccharide -induced neuroinflammation in rats: A behavioral and 1H NMR-based metabolomics study
Objective: This research revealed the biochemical outcomes of metabolic dysregulation in serum associated with physiological sickness behavior following lipopolysaccharide (LPS)-induced neuroinflammation in rats, and treatment with Clinacanthus nutans (CN). Verification of 1H NMR analysis of the CN aqueous extract proved the existence of bioactive phytochemical constituents’ in extract. Materia...
متن کاملChoosing the best pulse sequences, acquisition parameters, postacquisition processing strategies, and probes for natural product structure elucidation by NMR spectroscopy.
The relative merits of different pairs of two-dimensional NMR pulse sequences (COSY-90 vs COSY-45, NOESY vs T-ROESY, HSQC vs HMQC, HMBC vs CIGAR, etc.) are compared and recommendations are made for the preferred choice of sequences for natural product structure elucidation. Similar comparisons are made between different selective 1D sequences and the corresponding 2D sequences. Many users of 2D...
متن کاملHPLC-SPE-NMR: a productivity tool in natural products research
Natural products provide excellent potential leads for drug development because of their chemical diversity and biological functionality. However, the productivity of discovery of new, pharmacologically active natural products has traditionally been low due to inherent difficulties and costs associated with extract dereplication, i.e., isolation, purification and structure elucidation of indivi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006